Bilingual Corpus – Digital Repository for Preservation of Language Heritage
نویسندگان
چکیده
The article briefly reviews bilingual Slovak-Bulgarian/BulgarianSlovak parallel and aligned corpus. The corpus is collected and developed as results of the collaboration in the frameworks of the joint research project between Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, and Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences. The multilingual corpora are large repositories of language data with an important role in preserving and supporting the world's cultural heritage, because the natural language is an outstanding part of the human cultural values and collective memory, and a bridge between cultures. This bilingual corpus will be widely applicable to the contrastive studies of the both Slavic languages, will also be useful resource for language engineering research and development, especially in machine translation.
منابع مشابه
Bilingual children and adult heritage speakers: The range of comparison
This paper compares the language of child bilinguals and adult unbalanced bilinguals (heritage speakers) against that of bilingual native speakers of their home language (baseline). We identify four major vectors of correspondence across the language spoken by these three groups. First, all varieties may represent a given linguistic property in a similar way (child bilinguals = adult heritage s...
متن کاملTone restoration in transcribed Kammu: Decision-list word sense disambiguation for an unwritten language
The RWAAI (Repository and Workspace for Austroasiatic Intangible heritage) project aims at building a digital archive out of existing legacy data from the Austroasiatic language family. One aspect of the project is the preservation of analogue legacy data. In this context, we have at our hands a large number of mostly-phonemic transcriptions of narrative monologues, often with accompanying soun...
متن کاملRelational Database Preservation through XML modelling
Digital Archives are complex structures composed of human resources, state of the art technologies, policies and data. Due to the heritage keeping role that archives assume in our society, it is important to make sure that, the data that is produced by our organizations is preserved accordingly in order do document is activity and provide evidence of their activities. Information stored in an a...
متن کاملPreservation Planning: A Comparison Between Two Implementations
This paper examines preservation planning as it is implemented within the National Library’s preservation repository (Rosetta) and compares it directly to the PLATO tool created as part of the PLANETS project. Preservation planning is both a business precondition and the systematic framework defining any preservation action. At the National Library of New Zealand Te Puna Mātauranga o Aotearoa, ...
متن کاملCulture Heritage Digital Repositories. Research Questions
This discussion is about innovative solutions for assembling multimedia digital repositories for collaborative use in specific contexts and communities and enhancing scholarly understanding and experiences of digital cultural heritage. Several aspects are stress such as the dynamic aggregation of cross-media resources across existing institutional digital libraries and repositories. Research qu...
متن کامل